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Optimality  Conditions  in 
Generalized  Geometric  Programming 
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Abstract.  Generalizations  of  the  Kuhn-Tucker  optimality  conditions 
are  given,  as  are  the  fundamental  theoremshaving  to  do  with  their 
necessity  and/or  sufficiency. 
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1.  Introduction . Optimization  problems  from  the  real  world  usually  possess 
a linear-algebraic  component,  either  directly  in  the  form  of  problem  lin- 
earities (e.g.,  those  involving  the  node-arc  incidence  matrices  in  network 
optimization)  or  indirectly  in  the  more  subtle  form  of  certain  problem 
nonlinearities  (e.g.,  those  involving  the  coefficient  matrices  in  quadratic 
programing  or,  perhaps  more  appropriately,  those  involving  the  exponent 
matrices  in  signomial  programing)  . As  demonstrated  in  the  author's  recent 
survey  paper  [l]  and  some  of  the  references  cited  therein,  such  a compon- 
ent can  frequently  be  exploited  by  taking  a (generalized)  geometric  pro- 
graming approach.  In  fact,  geometric  programing  is  primarily  a body  of 
techniques  and  theorems  for  inducing  and  exploiting  as  much  linearity 
as  possible. 

The  following  sections  present  optimality  conditions  that  are 
tailored  to  geometric  programing  and  can  be  viewed  as  generalizations  of 
the  Kuhn-Tucker  optimality  conditions.  Although  the  fundamental  theorems 
having  to  do  with  their  necessity  and/or  sufficiency  have  already  been 
described  in  [l],  proofs  are  given  here  for  the  first  time. 

In  geometric  programming,  problems  with  only  linear  constraints  are 
treated  in  essentially  the  same  way  as  problems  without  constraints.  Only 
problems  with  nonlinear  constraints  require  additional  attention  and  hence 
are  classified  as  constrained  problems. 

Since  many  important  problems  are  unconstrained  (e.g.,  most  network 
optimization  problems),  and  since  the  theory  for  the  unconstrained  case  is 
much  simpler  than  its  counterpart  for  the  constrained  case,  the  unconstrained 
case  is  treated  separately  (even  though  its  theory  is  actually  embedded  in 
the  theory  for  the  constrained  case) . 
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Ma thematically,  this  paper  is  essentially  self-contained. 

2.  The  unconstrained  case.  Given  a nonempty  cone  Z C Er  (n-dimensional 
Euclidean  space),  and  given  a function  g with  a nonempty  domain  <3=  E^,  the 
resulting  "geometric  programming  problem"  3 is  then  defined  in  the  following 
way. 

PROBLEM  d . Using  the  "feasible  solution"  set 

A 

r>=znc., 

calculate  both  the  "problem  infimum" 

co  ^ Inf  g Or) 
x € J 

and  the  "optimal  solution"  set 

A 

3*  = {jo  € J | g (x)  =o} 

Needless  to  say,  the  "ordinary  programming"  case  occurs  when  Z is  actually 
the  entire  vector  space  E^. 

Our  optimality  conditions  for  the  preceding  problem  (3  utilize  the 
"dual  cone" 

A , 

I 0 s <Jtr,2/>  for  each  jo  6 Z]. 

0 

They  are  stated  as  part  of  the  following  definition. 


DEFINITION.  A critical  solution  (stationary  solution,  equilibrium  solution, 
P solution)  for  problem  (3  is  any  vector  x*  that  satisfies  the  following  P 


■■■ 


, , ii  ■iiiij^p  m.  ,.  --- ~’™_  _____  __  ^ — ' 
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optimality  conditions: 

x*ez  nc,  vqix*)^ 

and 

0 = (x*,  V^(x*)>. 

If  the  cone  Z is  actually  a vector  space,  then  \=ZX  and  hence  the  P 
optimality  condition  0 = (x*,  V g (x*))  is  redundant  and  can  be  deleted. 
Furthermore,  in  the  ordinary  programming  case,  the  vector  space  y = = [0 ) , 

so  the  remaining  P optimality  conditions  become  the  (more  familiar)  "or- 
dinary optimality  conditions" 

x*£C,  and  VgOc*)  = 0 . 

The  following  theoran  gives  two  convexity  conditions  that  guarantee 
the  necessity  and/or  sufficiency  of  the  P optimality  conditions. 

Theorem  1.  Under  the  hypothesis  that  g is  differentiable  at  x*, 

(i)  given  that  Z is  convex,  if  x*  is  an  optimal  solution  to  problem 
3 , then  x*  is  a critical  solution  for  problem  d (but  not  conversely) , 

(ii)  given  that  g is  convex  on  (l,  if  x*  is  a critical  solution  for 
problem  (7,  then  X*  is  an  optimal  solution  to  problem  3. 

Proof.  To  prove  part  (i),  first  recall  that  the  optimality  of  r*  implies 
that  x*  €Z HC.  Then,  notice  that  the  optimality  of  x*,  the  differenti- 
ability of  g at  X*.  and  the  convexity  of  Z imply  that  the  directional 


derivative 


r 
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(Vt?  (x*)  , x)  > 0 for  each  X^X. 

Likewise,  Che  optimality  of  x* , the  differentiability  of  g at  X*,  and  the 
observation  that  x*+s(-x*)€X  for  s^l  imply  that  the  directional  deri- 
vative 

(7j0c*),  - X*)  >0. 

Consequently,  7g  (x*)  6 % and  0 = (x*,  7 0 (r*)). 

Counterexamples  to  the  converse  of  part  (i)  are  numerous  and  easy  to 
construct.  In  fact,  the  reader  is  probably  already  familiar  with  counter- 
examples from  the  ordinary  programming  case. 

To  prove  part  (ii),  first  recall  that  the  convexity  of  Q and  the 
differentiability  of  g at  x * imply  that 

g(x)  ~p(x*)  > (7  g (x*) , x - x*)  for  each  x€C. 

Then,  notice  that  the  assumptions  0 = (x*,  7g  Or*))  and  7g(x*)€'ty  imply  that 

(7g(x*),  x - x*)  = {73  Or*) , x)  0 for  each  x^X. 

From  the  preceding  displayed  relations  we  see  that  g(x)  - g (x*)  >0  for  each 
Jf€3jnc.  Consequently,  the  assumption  x*€Zn£.  shows  that  X * is  optimal 
for  problem <7.  q.e.d. 

It  is  worth  noting  that  g is  differentiable  everywhere  for  most  of 
the  examples  given  in  section  2.1  of  [l] . Moreover,  X is  polyhedral  and 
hence  convex  for  each  of  those  examples,  and  g is  convex  for  important 
special  cases  of  each  of  those  examples.  Consequently,  the  P optimality 
conditions  frequently  characterize  the  optimal  solution  set  <J*  for  problem <7. 


J 


r 


3.  The  constrained  case.  To  extend  geometric  programming  by  the  explicit 
inclusion  of  (generally  nonlinear)  constraint  functions,  we  introduce  two 
nonintersecting  (possibly  empty)  positive-integer  index  sets  I and  J with 
finite  cardinality  o( 1)  and  o(J)  respectively.  In  terms  of  these  index 
sets  I and  J we  also  introduce  the  following  notation  and  hypotheses: 


(la)  For  each  k £ (0}bIUJ  there  is  a function  g^  with  a non- 
empty domain  C - E , and  there  is  a nonempty  set 
k n. 
k 

D.  SE  for  each  j € J . 

j "i 

(2a)  For  each  k(?(0]UIUJ  there  is  an  independent  vector  variable 

k 

x in  E , and  there  is  an  independent  vector  variable  K with 
nk 

components  for  each  J £ J. 

(3a)  x1  denotes  the  cartesian  product  of  the  vector  variables  x*-, 
i€I,  and  x^  denotes  the  cartesian  product  of  the  vector 
variables  x^ , j € J.  Hence,  the  cartesian  product  (x^,x*,xJ)  = x 
is  an  independent  vector  variable  in  E^,  where 

A 

n = n„  +2  n +£  n . 

I J 3 

(4a)  There  is  a nonempty  cone  XCE^. 

The  resulting  "geometric  programming  problem"  A is  then  defined  in  the 
following  way. 

Problem  A.  Consider  the  objective  function  G whose  domain 

C = {(x,K)  |xk€Ck,  k€[0]UI,  and  (xJ,  Kj)€cj,  j <E  J) 
and  whose  functional  value 


1 

1 
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' ere 


* i l 

C = ( (x^ , < ) | either  K = 0 and  sup  <xJ , dJ ) < + « , or 
J J J dj  € Dj 


K . > 0 and  x1^  6 K C , ) 
J J J 


and 


sup  <xj,dj)  if  K =0  and  sup  <xJ,dJ><-K° 


dJ  € D 


^ “ dJ€D. 


K g (xJ/K  ) if  K.  >0  and  xJ  g K C . 
J J J J J J 


Using  the  feasible  solution  set 

^ i 

S = { (x,K)  € C | x € X,  and  g^x  )S0,  i £ i}  , 


calculate  both  the  problem  infimum 

A 

c o=  inf  G(x,K) 

(x,K)  6 S 

and  the  optimal  solution  set 

A 

S*=  [(x,K)  € S | G(x,K)  =cp). 

A 

Needless  to  say,  the  unconstrained  case  occurs  when  I=J=0,  8q;Cq 

and  X=Z.  On  the  other  hand,  the  "ordinary  programming"  case  occurs  when 

J = 0, 

A 

n.  = m and  C = C for  some  set  C CE 
Tc  k o o « 


k€  {0}  UI, 
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and 


A 

X = column  space  of 


where  there  is  a total  of  l+o(I) 


identity  matrices  U that  are  mxm. 


In  particular,  an  explicit  elimination  of  the  vector  space  condition  x€X 
by  the  linear  transformation 


U 


shows  that  problem  A is  then  equivalent  to  the  very  general  ordinary  pro- 
gramming problem 


Minimize  g^  (z)  subject  to 
g.(z)sO  161 


z 6 C . 
o 


Our  optimality  conditions  for  the  preceding  problem  A utilize  the 
dual  cone 

Y = fy  6 E |0s(x,y)  for  each  x6X}, 
n 

whose  vector  variable  y has  the  same  cartesian-product  structure  as  the 
vector  variable  x.  They  are  stated  as  part  of  the  following  definition. 


DEFINITION:  A critical  solution  (stationary  solution,  equilibrium 

solution,  P solution)  for  problem  A is  any  vector  (x*,K*)  for  which  there 
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is  a vector  X*  in  E ,TN  such  that 

o(I) 

following  P optimality  conditions: 


(x*,K*)  and  X*  jointly  satisfy  the 


x*€  X, 


and 


where 


gt(x  1)  i 0 


* 

>0 


* 

Xigi(x  ) = o 


y*6  y, 

0 =<x*,y*> 


*0  A . *0, 
y = vgQ(x  >> 


and 


*i  A * *i 

y -Xivgj,(x  ) 


*i  A *4  * 

y = vgj(x  J/x  ) 


161, 

161, 

161, 


j 6 J, 


16  1, 


j 6 J. 


Needless  to  say,  if  the  cone  X is  actually  a vector  space,  then  Y = X1  and 
hence  the  P optimality  condition  0 = (x*,y*)  is  redundant  and  can  be 
deleted.  Furthermore,  In  the  ordinary  programming  case,  the  vector  space 

* = (y  6 E | y°  +2  y1  = 0 } , 

I 

so  the  remaining  P optimality  conditions  are  essentially  the  (more  familiar) 
"Kuhn-Tucker  optimality  conditions" 
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g^z*)  SO  i € I, 

X*  *0  i € I, 

x*gi(z*)=o  iei, 

and 

v gQ  (z*)  +s  X*  7 gt  (z*)  = 0 . 

On  the  other  hand,  the  following  Important  concept  from  ordinary  pro- 
gramming plays  a crucial  role  in  the  theory  to  come. 


DEFINITION.  For  a consistent  problem  A with  a finite  infimum  CD,  a Kuhn- 


Tucker  vector  is  any  vector  X*  in  E 


o(I) 


with  the  two  properties 


* 

xt  *0 

and 


i€I, 


co  = inf  L (x,K;X*), 
(x,K)  €C  ° 
x € X 


where  the  (ordinary)  Lag rang ian 

A i 

Lq (x, K ; X)  =G(x,tC)  +Z  Xigi(xi). 

It  is  important  to  realize  that  the  preceding  definition  of  Kuhn-Tucker 
vectors  differs  considerably  fvom  the  widely  used  definition  involving 
the  Kuhn-Tucker  optimality  conditions.  Even  in  the  ordinary  convex  program- 
ming case  the  two  definitions  are  not  equivalent,  though  it  is  well-known 
that  the  preceding  definition  simply  admits  a somewhat  larger  set  of  vectors 
in  that  case. 

The  following  theorem  gives  two  convexity  conditions  that  guarantee 
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the  necessity  and/or  sufficiency  of  the  P optimality  conditions. 

*k 

Theorem  2.  Under  the  hypotheses  that  gk  is  differentiable  at  x , 

+ * 

k€{o}llIUJ  and  that  g ^ is  differentiable  at  (x  “ , < , j € J* 

(i)  given  that  X is  convex,  if  (x*,K*)  is  an  optimal  solution  to 
problem  a and  if  X * is  a Kuhn-Tucker  vector  for  problem  A,  then  (x*,K*) 
is  a critical  solution  for  problem  A relative  to  X*  (but  not  conversely) , 
(ii)  given  that  g^  is  convex  on  C^,  k£(0)UIUJ,  if  (x*, K*)  is 
a critical  solution  for  problem  A relative  to  X*,  then  (x*,K*)  is  an 
optimal  solution  to  problem  A and  X * is  a Kuhn-Tucker  vector  for  problem 
A. 


Proof.  To  prove  part  (i),  first  recall  that  the  optimality  of  (x*, K*) 

*i 

implies  that  x*  € X and  that  g^(x  )^0,  i£I.  Then,  note  that  the  defining 

* 

properties  for  a Kuhn-Tucker  vector  X * assert  that  \^SrO,  i€I  and  that 
_ * *1 

cp  £ G(x*,K*)  +2^  X,g.(x  ).  Since  cp  = G(x*,  X*)  , the  preceding  inequalities 
I 11 

A * *£ 

collectively  imply  that  X^g^(x  ) =0,  i€l;  from  which  we  infer  that 

L (x*,K*;X*)=  inf  L (x,K;X*). 

° (x,K)  € C ° 

x € X 

Since  our  hypotheses  clearly  imply  that  Lq(  • ,x;X*)  is  differentiable  at 
(x*,K*),  the  preceding  equation  and  the  convexity  of  X imply  that  the 
directional  derivative 

<VxLo(x*,K*;X*),  x)>0  for  each  x£X. 

Likewise,  the  differentiability  of  Lq(  • ,X;X*)  at  (x*,K*),  the  preceding 
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equation,  and  the  observation  that  x*  + s(-x*)€X  for  ssl  imply  that  the 


directional  derivative 


< V L (x*,K*;X*),  - x*>  ^0. 
X o 


Consequently,  V Lq (x*,  K*;  X*)  € Y and  0 = (x  , VxLq (x*,  K*\  X*) ) , which  means 
that  y*£Y  and  0 = (x*,y*).  Finally,  since  our  hypothesis  that  g*  is 
differentiable  at  (x  .K^),  -1  clearly  implies  that  > 0 , the  differenti- 

ability of  Lq(  • ,x; X*)  at  (x*,K*)  and  the  preceding  displayed  equation 
imply  that 

7.L  (x*,K*;X*)  =0, 
it  o 

which  means  that  (x  J ,y  J)  = (x  J , K ^ ) , j 6 J« 

Counterexamples  to  the  converse  of  part  (i)  are  numerous  and  easy  to 
construct.  In  fact,  the  reader  is  probably  already  familiar  with  counter- 
examples from  the  ordinary  programming  case. 

To  prove  part  (ii),  first  observe  that  Lq(  • ,x;X*)  is  convex  on  C and 

that  L ( • ,x;X*)  is  differentiable  at  (x*,K*),  These  two  observations 
o 

together  imply  that 

Lo(x,K;X*)  - Lq(x*,K*;X*)  5 <v(x>K;)L0<x*« **;**>>  <***>  - (x*,K*)> 

for  each  (x, K)  f C. 

*«  + *1  * 

Since  the  assumption  that  (x  ,y  ) = g j (x  , j £ J simply  means  that 
7„L  (x*,*C*; X*)  = 0,  elementary  linear  algebra  shows  that 

K O 

<V(x,iC)Lo(x*,K*;X*)’(x.K)  - (x*.'c*)>  = <Vo(x*,K*;X*)’  x'x*>  for  each  (x»K)* 

Since  it  is  clear  that  7xLo(x*,K*;X*)  =y*,  the  assumptions  that  0 = (x*,y*) 
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1 


and  y*  € Y imply  that 

(7xlq(x*>  **;  ^*) » x-x*)  = (V^L^Cx*,  K*;  \*)  , x)  ^ 0 for  each  (x,K)  for  which  x€X. 

From  the  preceding  displayed  relations  we  infer  that 

Lo(x,K;\*)  - Lo(x*,K*;\*)  >0  for  each  (x,K)€C  for  which  x € X. 

Using  this  inequality  and  the  assumption  that  X^g^(x  )=0,  i 6 I,  we  see 
that 

G(x*, K*)  s G(x, K)  +ZI  X.g.  (x*)  for  each  (x,K)€C  for  which  x€X. 

I 11 

•ff 

On  the  other  hand,  the  assumption  that  X^iO,  i€l  guarantees  that 

G(x,K)  +E  X1g1(xi)sG(x,<)  for  each  (x,K)€C  for  which  g^x^sO,  i€I. 
From  the  preceding  two  displayed  inequalities  we  infer  that 

G(x*,  K*)  s G(x,  K)  for  each  (x.lC)fS. 

Consequently,  the  assumptions  that  g^x^JsO,  i6I  and  that  x*€X  imply 
that  (x*,K*)  is  optimal  for  problem  A,  which  means  of  course  that  cp  = G(x*,K*). 
Using  these  facts  and  the  assumption  that  X*g1(x*1)  =0,  i€I,  we  infer  from 
the  last  displayed  inequality  involving  Lq  that  X*  is  a Kuhn-Tucker  vector 
for  problem  A.  q.e.d. 

It  is  worth  noting  that  for  most  of  the  examples  given  or  alluded  to 
in  section  2.2  of  [l]  the  gk>  k€  {0}  UI  U J are  differentiable  everywhere 
while  either  J is  empty  or  the  gj,  j € J are  differentiable  everywhere 
except  at  the  origin.  Moreover,  X is  polyhedral  and  hence  convex  for  each 
of  those  examples,  and  the  g^,  k€[0}UIUJ  are  convex  for  important 


special  cases  of  each  of  those  examples.  Consequently,  the  P optimality 
conditions  frequently  characterize  the  optimal  solution  set  S*  for  problem 
A. 

Characterizations  of  S*  that  do  not  require  differentiability  of  the 
g^.  k€fo}l)IUJ  and  the  g^ , j€J,  but  do  require  conjugate  transform 
theory  are  given  in  [2]. 
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